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From the entire population (N— 341 ) or grades 7 and 8 
in a rural Indiana junior high school, 160 subjects were randomly 
selected and assigned to the experimental and the control groups. 

Form A of the Kelson Reading Test was administered twice with a 
4-week interval. While the control group was told only that the 
post-test was given to measure how much they had learned since the 
last testing, the experimental group was told that those making any 
performance improvement would win material rewards. Those making the 
most increase received one of six transistor radios; others received 
a university sweatshirt or a 25-cent candy bar. Results were analyzed 
using analysis of covariance. Two major dependent variables were the 
total number correct (NC) and the total number of items attempted 
(KA) . post-test means were adjusted for pretest differences between 
the two groups. The effect of the reward treatment was found to be 
significant in all instances on the NC variable <p<-01) and on the NA 
variable (p<.001). Results for the vocabulary and comprehension 
subtist ytri also imported* It was indicatii that th6 ©xporiinontal 
group outperformed the control group by more than 3 months in total 
means. Tables and references are included. (AM) 
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Rationale 



In the liter at lore surrounding the accountability boom, the 
issu e of raisins performance by offering material rewards to students 
is discussed frequently. Two cases must be distinguished here. 

First, there is the possibility to entice students to learn more, 
to study harder, etc. in order to obtain a prise of some kind. 

Though in a sense achievement gains in this case may he said to 
be due to manipulation of factors extrinsic to learning, the .. 
distinguishing feature of this situation is that high post instruc- 
tional test scores reflect a genuine increase in the skills or 

understandings tapped by the test in question. 

The second case is characterised hy absence of such learning. 
Here, final test scores are higher than initial best scores merely 
because of promises of material rewards just prior to taking the test, 
in this situation achievement scores are raised while it must be 
assumed that no learning occurred between initial and final measures. 

In instructional settings, whenever rewards are available for 
performance on some post instructional measure, a confounding of 
both cases exists. It might he hypothesized that performance 
increases on such a measure may have their source 1) in actual 
learning, whether or not induced hy the awaiting rewards, 2> in 
an increase of energy and coneentraticn on the post test and 
3)ifc a blend of l) and 2); 

while the above analysis .seems 18 kn °* 
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about whether or not post test scores actually can be boosted merely 
by offering material rewards. The question central to the 
present study therefore is* Can a significant increase in test 
scores be induced by offering material rewards in absence of 
learning? 

Tc view of the increasing emphasis on accountability this 
question seems to have an undeniable urgency* 

A second question ashed in this study* related to the first one* 
is whether such rewards would lead to a higher number of items 
attempted. This question is of interest since in many cases time 
limits for standardised reading tests prohibit attempting all items 
under standard levels of motivation. 

Method 

Materials 

The achievement measure used in this study was form A of the 
Nelson Reading Test. (Nelson* 3 . 962 ) This test consists of two 
parts: vocabulary (100 iteme) and Comprehension (73 items). The 

rationale for selecting a test where speed is a relatively large 
factor is the assumption that increases in energy on the students' 
part might be most directly reflected in increases in number 
of the items attempted. 

Subjects 

Tu this study a Pre-post randomized group design vas utilized. 
The accessible population was defined as the entire student body 
in the 7th and 8th grade of a Junior High School in a rural area 
of Southern Indiana (N=34l) . From this population the experimental 
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s ample (nrfjfiO) WS selected randomly, using a random number table, 
Subjects were then assigned to the Experimental Group and a 
Control Group by the same random process, (Through an oversight 
the E-group contained 8l Ss and the C-group F9)» 



procedure 

At the first testing all Ss in the accessible population were 
tested. Testing took place in regular class sessions. All 3^1 
students were tested in order to increase the information returns 
to the school system. By the method mentioned above, the Ss to be 
reta in ed for post besting (B and C groups) were selected. After 
an interval of k weeks these Bs were post-tested. The C-group 
was post tested in its entirety in the school cafeteria during the 
first school period in the morning. Their instructions were simply 
that they had to take the test again in order to find out how 
much they had learned si nee the last testing. 

Immediately following the testing of the C-group, the E-group 
was tested. The treatment for this group consisted of the availability 
of .performance rewards ; 6 transitor radios (at &.95), 9 sweatshirts 

with the emblem of nearby Indiana University a£d a supply of 25 cent 

candy bars. The directions for this group were: 

nV few w-eks ago you took a test and we have now scored your 
At Se wutfiLe to have you try to heat your first test 
seore. We have some prizes for you If you can do better, on this 
than you did on the last test. We don't care who makes the ^ 
highest score. We want to see who can make more 12* ”“ 

£ft than on the first test. All you need to 

your own score by as many points as you can. The students 
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who raise their scores the mos t will win prises. Remember , 
it's not the student that scores highest that wins; it 7 s 
the student tho raises his score the most . 

If you are one of the top 6 students who gain the most points 

you will win a transitor radio NOT the 6 highest scores 

but the 6 who impiO/s the most. 

If you are one of the next 9 students who raise their score 
the most pilnts, you win win one of these ItJ shirts. 

And if you raise your score even as little as one point, you 
get one of these candy bars. So everyone who gets even Just 
one point higher will get a prize. 

If you do not score any higher on this test than the first 
test there are no prizes, 

Tc*s not hard. Just try to beat your ©wn score. Compete with 
no one but yourself , All you have to do is work harder and 
try to do more items and more of them correctly than you did 
last time. Raise your score and take home one of the prizes," 



Analysis 

The results were analyzed using analysis of covariance, with 
pre-test scores as the covariate in the case of all dependent 
variables. Two major dependent variables were subjected to analysis 
the total, numbers correct, (itfC) and the total number of items 
attempted (NA), Results for the vocabulary and comprehension 
subtests are also reported. 



Results 

Table 1 summarizes the descriptive data for the number correct 
(NC) dependent variable. Since the slopes of the within group 
repression lines were not significantly different, in fact, they 
were nearly identical, ANCOVA was used as planned. Table 2 shows 
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post test means adjusted f'or pretest differences between the experi- 
mental and control group mean scores. As was to be expected the 
coefficients of homogeneity were quite high (Table 3) matching 
those reported by the test publisher. It may be mentioned that the 
stability coefficients (product moment correlation between pre and 
post- test scores) were somewhat lower. For -vocabulary, comprehension 
end total scores these coefficients were for the Experimental group 
,82, ,7k and ,85 respectively; for the Control 8s these correlations 
were .73, .84 and .03. 

In Tables 4 through 6 the results of the AM? OVA on the NC 
variable are reported. The effect of the treatment is significant 
in all instances. The probability that differences on the total 
score variable could have arisen by chance rather than as a 
consequence of experimental manipulation is smaller than . 01. 

As was discussed above, it was assumed that the Experimental C# 
would attempt more items than the Control group. It must be pointed 
out that a test on the difference between mean number of items 
attempted is not a direct test on the hypothesis that the increased 
number correct is a function of the increase in numbers attempted. 

The notion that increase in speed led to more errors hut that these 
errors were offset by an increase in concentration on the experimental 
subjects part was not tested in this experiment , The correlation 
coefficients in Table 7 seem to suggest a careful interpretation 
of the relation between number of attempts and number of items 
correct. 




O 



-b- 

Table 8 summarizes the descriptive data for the dependent 
variable Number of items Attempted (MA)- In fable 9 post-test 
means adjusted for pre-test differences are presented. 

From Tables 10-12 it can be seen that the differences between 
treatment means when HA is the dependent variable, is significant 
at the .001 level. The experimental group attempted significantly 
more items that the control group. 

Table 13 summarizes the ANCOVA’s and indicates the level of 
significance of the various effects . 

Discussion 

The data presented above indicates unequivocally that it is 
possible to raise test scores merely by presenting material rewards. 

Again it must be pointed out that tihe differences between 
experimental and control group means cannot be explained by reference 
to any learning. 

Here is a case where differences not only have statistical 
significance but practical significance as well. According to the 
table of norms in the publisher’s manual, the experimental group 
out performed the control group by more than 3 months (based on 
adjusted total means. Table 2), 

From the point of view of performance contracting business, 
the results obtained are quite interesting. Figure 1 shows a 
table of payments which is based on a real contract ( ) * In 



this contract gains per ,year teaching are expressed in portions 




of a year. For the purpose of this study, the authors used the same 



base for payments but substituted the unit "year*' for "month”, 
since the experiment lasted only 4 ■weeks. On this basis the author 
"earned" 8 L 5 ,, 260 , if no penalty was enforced for students who dropped 
back, or $2,86 o with such a penalty, Kb small return for an investment 
of six radios, nine sweaters and some candy, or $75 in all. 




Tnble 1 



Means and Standard Deviations for the Experimental 
(n«8l) and the Control Groups, (n=79) 





(Number 


* correct is 


dependent 


variable) 










Pre-test 






Post-test 






Voeab. 


Compre , 


Total 


Vocab, 


Compre , 


Total 


Experimental 

(n=8l) 


56,20 

(10,56) 


42,00 

(11,02) 


98,20 

(20,73) 


62.37 

(11.88) 


50.62 

(12.04) 


112. 99 

(22.91) 


Control 

(n=?9) 


54.91 

(11.06) 


(40.46) 

(11.42) 


95.37 

(21,08) 


(58,71) 

(12.69) 


46.87 

(13.07) 


105.60 

(24.91) 









Means and Standard Errors on the Post-test adjusted for Pre-test 
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Control 59.39 1*7.65 107.13 

(.55) (.69) 3 (.99) 



Table 3 

Reliab il ity Coefficients and Standard Errors of Measurement 
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Table 7 



Pearson Product Moment Correlations between 
Humber Correct and Number Attempted Scores 





Pre-test 

Voc. 


Compreh, 


Total 


Voe. 


Post-test 
Compreh. Total 




Experimental 


— 


.83 


.88 


,88 


.75 


,w 




(n— 8l) 
















Control 


.76 


• 

CO 


.78 


.90 


.84 


• 

CO 

VO 




(n-79) 
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AMC0VA for Vocabulary Post Test Scores, 




* 




ANCOVA for Total Beading Post Test Score, 
scores as covariate. (Number attempted is dependent variable.) 




eric: 



#** p < ,001 



for It ems Correct and Items Attempted Post Test mean scores. 




Pay 



Table of Payments 



Schedule of Payments; 



Gain 




0,10 — 


,19 mths 


0,20 — 


.29 mths 


0.30 — 


,39 mths 


0,40 — 


*i* mths 



aioo.oo 

6130.00 

ai4o,oo 

6150.00 



TO be paid to the experimental students after subtraction of mean gam 
of control group from each individual* s gain. 



A 

(without penalty for loss) 

$5260 



B 



(with penalty for loss) 
$2860 
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